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CUTOFF ON ALL RAMANUJAN GRAPHS 


EYAL LUBETZKY AND YUVAL PERES 


Abstract. We show that on every Ramanujan graph G, the simple 
random walk exhibits cutoff: when G has n vertices and degree d, the 
total-variation distance of the walk from the uniform distribution at 
time t = j^log^_in + s^/]ogn is asymptotically ¥{Z > cs) where 
Z is a. standard normal variable and c = c(d) is an explicit constant. 
Furthermore, for all 1 < p < oo, d-regular Ramanujan graphs minimize 
the asymptotic L^-mixing time for SRW among all d-regular graphs. 

Our proof also shows that, for every vertex x in G as above, its distance 
from n — o(n) of the vertices is asymptotically logjj_i n. 

1. Introduction 

A family of d-regular graphs Gn with d > 3 fixed is called an expander, 
following the works of Alon and Milman [4,6] from the 1980’s, if all nontrivial 
eigenvalues of the adjacency matrices are uniformly bounded away from d. 
Lubotzky, Phillips, and Sarnak [25] defined a connected d-regular graph G 
with d > 3 to be Ramanujan iff every eigenvalue A of its adjacency matrix 
is either ±d or satisfies |A| < 2y/ d — 1. Such expanders, which in light of 
the Alon-Boppana Theorem [29] have an asymptotically optimal spectral 
gap, were first constructed, using deep number theoretic tools, in [25] and 
independently by Margulis [28] (see also [13,24] and Fig. 1). Due to their 
remarkable expansion properties, Ramanujan graphs have found numerous 
applications (cf. [19] and the references therein). However, after 25 years of 
study, the geometry of these objects is still mysterious, and in particular, 
determining the profile of distances between vertices in such a graph and 
the precise mixing time of simple random walk (SRW) remained open. 

Formally, letting \\fj, — zz||xv = sup^[//(A) — i^{A)] denote total-variation 
distance, the (L^) mixing time of a finite Markov chain with transition kernel 
P and stationarity distribution vr is defined as 

iMix(e) = min{f : D^v{t) < e} where DTy{t) = max ||P*(x, •) — ttHtv • 

X 

A sequence of finite ergodic Markov chains is said to exhibit cutoff if its 
total-variation distance from stationarity drops abruptly, over a period of 
time referred to as the cutoff window, from near 1 to near 0; that is, there 
is cutoff iff tMix(e) = (1 + o(l))fMix(£0 fixed 0 < e,e' < 1. 

Our main result shows that the Ramanujan assumption implies cutoff 
with tMix(e) = ( 3 x 2 '^(1)) loSd-i ^ window 0{\/logn). 
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Figure 1. A ball of radius 4 in the Lubotzky-Phillips-Sarnak 
6-regular Ramanujan graph on n = 12180 vertices via PSL(2,F29). 

Theorem 1. On any sequence of d-regular non-bipartite Ramanujan graphs, 
SRW exhibits cutoff. More precisely, let G be such a graph on n vertices and 

U{n) := ^2 

Then for every fixed s G M and every initial vertex x, the SRW satisfies 

Dtv+ sy^log^.^n) -^¥{Z > Cds) as n —)• oo , (1-1) 

where Z is a standard normal random variable and cs = . 

2y/d{d-l) 

Consequently, we obtain that the profile of graph distances from every 
vertex a: in a d-regular Ramanujan graph G concentrates on log ^_2 n (the 
minimum possible value it can concentrate on in a d-regular graph). 

Corollary 2. Let G be a d-regular Ramanujan graph on n vertices. Then 
for every vertex x in G, 

#{y : |dist(x, y) - log^.i n| > 3 log^.i log n} = o(n), 

and furthermore, for all except o{n) vertices y there is a nonbacktracking 
cycle^ through x, y of length at most 2 log^_i n -|- 6 log^_;^ log n. 

More can be said about high-girth Ramanujan graphs, e.g., the bipartite 
LPS expanders whose girth is asymptotically ^log^-iu (see, e.g., [24, §7]). 

Corollary 3. Let G be a d-regular Ramanujan graph with n vertices and 
girth g, and set R = [log^_i n -|- 51og^_]^ logn]. For every k < g — R and 
simple path (xj)jL]^ in G, for all but a o{l)-fraction of simple paths (yi)f=i 
in G, there are vertex-disjoint paths of length R from Xi to yi for all i. 

nonbacktracking cycle is a sequence of adjacent vertices vo,... ,Vk such that vo = Vk 
and Vi 7 ^ U(i+ 2 ) mod k for all i. 
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Figure 2. Distance of SRW from equilibrium in (blue) and L?' 
(red, capped at 1). On left, the LPS graph on PSL(2,F29) shown in 
Fig. 1; on right, asymptotics via Theorem 1 and Proposition 6. 




Figure 3. The analogue of Fig. 2 for the NBRW (see §1.4). 

That Corollaries 2-3 also cover bipartite Ramanujan graphs (recently 
shown to exist for every degree d > 3 in [27]) follows form an extension of 
the proof of Theorem 1 to the bipartite setting (see Corollary 3.9). Moreover, 
it extends to the case where the graph G is weakly Ramanujan (see §1.2). 

1.1. Background and related work. The cutoff phenomenon was first 
identified in pioneering studies of Diaconis, Shahshahani and Aldous [1,2,14] 
in the early 1980’s, and while believed to be widespread, rigorous examples 
where it was confirmed were scarce. In view of the canonical example where 
there is no cutoff—SRW on a cycle—and the fact that a necessary condition 
for any reversible Markov chain to have cntoff is for its inverse spectral-gap to 
be negligible compared to its mixing time, the second author conjectured [30] 
in 2004 that on every transitive expander SRW has cutoff. 

Durrett [15, §6] conjectured in 2008 that the random walk should have 
cutoff on a uniformly chosen d-regular graph on n vertices (typically a good 
expander) with probability tending to 1 as n —)■ oo; indeed this is the case, as 
was verified by the first anthor and Sly [22] in 2010. Snbsequently, expanders 
without cutoff were constructed in [23], but these were highly asymmetric. 
The conjectured behavior of cutoff for all transitive expanders was reiterated 
in the latter works (see [22, Conjecture 6.1] and [23, §3]), yet this was not 
verified nor refuted on any single example to date. 
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As a special case, Theorem 1 confirms cutoff on all transitive Ramanujan 
graphs—in particular for the Lubotzky-Phillips-Sarnak graphs (see Fig. 2). 

The concentration of measure phenomenon in expanders, discovered by 
Alon and Milman [6], implies that the distance from a prescribed vertex 
is concentrated up to an 0(l)-window. Formally, for every sequence of 
expander graphs Gn on n vertices and vertex x G V[Gn) there exists a 
sequence mn,x and constants a, C > 0 so that, for every r > 0, 

#{?/ G V{Gn) : |distG„(x,?/) - mn,x\ > r] < Ga~'"n. (1.2) 

Corollary 2 shows that rrin^x = log^_^ n+0(loglogn) for Ramanujan graphs. 

As for the diameter, Alon and Milman ([6, Theorem 2.7]) showed that 
diam(G) < {d — A) log 2 n for every d-regular graph G on n vertices 

where all nontrivial eigenvalues are at most A in absolute value. This bound 
was improved to [log^/;^(n — 1)] by Chung [11, Theorem 1], and then to 

L cTsh~Rd/A)* -I + ^ ™ nsing properties of Tfc(x), the Chebyshev polynomials 
of the first kind. Since cosh(i log(d—1)) = d/{2\Jd — 1) for any d, this bound 
translates to 2 log^_i n + 0(1) for Ramanujan graphs, and remains the best 
known upper bound on the diameter of the LPS expanders (for which this 
was proved directly in [25] via the polynomials T}.{x) as later used in [12]). 
Corollary 2 implies this asymptotically for every Ramanujan graph: as the 
distance from any vertex x to most of the vertices is (1 + o(l)) log^_]^ n, 
the distance between any two vertices x,y is at most (2 + o(l)) logrf_i n. 
Moreover, one can deduce that for every two vertices x, y and every integer 
£ > (2 + o(l)) log^_]^ n, there exists a path of length exactly i between x, y. 

A new impetus for understanding distances in Ramanujan graphs is due 
to their role as building blocks in quantum computing; see the influential 
letter by P. Sarnak [32]. Some of Sarnak’s ideas were developed further 
by his student N.T. Sardari in an insightful paper [31] posted to the arXiv 
a few months after the initial posting of the present paper. For a certain 
infinite family of {p + l)-regular n-vertex Ramanujan graphs, Sardari [31] 
shows that the diameter is at least [| logp(n)J and also gives an alternative 
proof of the first part of Corollary 2. 

1.2. Extensions. A sequence of connected d-regular graphs {d > 3 hxed) 
Gn on n vertices is called weakly Ramanujan if, for some 6n = o(l) as 
n —>■ oo, every eigenvalue A of Gn is either id or has |A| < 2\/d^M + 6n- 

Theorem 4. On any sequence of d-regular non-bipartite weakly Ramanujan 
graphs, SRW exhibits cutoff. More precisely, if Gn is such a graph on n 
vertices then for every initial vertex x, the SRW has 

iMix(e) = + o(l)) logrf_i n for every fixed 0 < e < 1. 
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Corollary 5. Let Gn be a d-regular weakly Ramanujan sequence of graphs 
on n vertices. Then for every vertex x in Gn, 



dist(a;, y) 
logd-i n 



for every fixed e > 0 . 


Remark. The weakly Ramanujan hypothesis in Theorem 4 and Corollary 5 
may be relaxed to allow some exceptional eigenvalues; for instance, we can 
allow eigenvalues A to only satisfy |A| < d — e' for some e' > 0 fixed. 


By the result of Friedman [17] that a random (uniformly chosen) d-regular 
graph on n vertices is typically weakly Ramanujan (as conjectured by Alon), 
Theorem 4 then implies cutoff, re-deriving the above mentioned result of [22] . 

More generally, for two graphs F and G, a covering map (f : V(G) —)• V(F) 
is a graph homomorphism that, for every x G V{G), induces a bijection 
between the edges incident to x and those incident to 4>{x). If such a map 
exists, we say C is a lift (or a cover) of F\ a random n-lift of T is a uniformly 
chosen lift out of all those with cover number n {i.e., j0“^(x)j = n for all x). 

Friedman and Kohler [18] recently proved (see also [9, Corollary 20] by 
Bordenave) that for every fixed d-regular base graph F and d > 0, if G is 
a random n-lift of F then typically all of its “new” eigenvalues (those not 
inherited from F via pullback) are at most 2\/d — 1 -|- 6. By the remark 
above. Theorem 4 and Corollary 5 apply here (for any fixed regular F). 


1.3. Cutoff in L^-distance. Theorem 1 showed that Ramanujan graphs 
have an optimal tuix for SRW: the total-variation distance from (1.1) matches 
a lower bound valid for every d-regular graph on n vertices (Fact 2.1 in §2). 
It turns out that Ramanujan graphs are extremal for L^-mixing for all p > 1. 

For 1 < p < oo, the L^-mixing time of a Markov chain with transition 
kernel P from its stationary distribution tt is defined as 


^Mix^(^) = niin{t : Dp{t) < where Dp{t) = max 


Pfx,-) 


- 1 


LP(7v) 


(note that p = 1 measures total-variation mixing since Di{t) = 2DT.^{t), 
whereas the L^-distance D 2 {t) is also known as the chi-square distance). 
Chen and Saloff-Coste [10, Theorem 1.5] showed that a lazy random walk 
on a family of expander graphs exhibits L^-cutoff, at some unknown location, 
for all p G (1, oo]. (On the notable exception of p = 1, it is said in [10] that 
there “the question is more subtle and no good general answer is known.”) 

The following theorem gives a lower bound on for SRW on a d-regular 
graph, asymptotically achieved by Ramanujan graphs for all p G (l,oo]. 
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Figure 4. Normalized eigenvalues of Ramanujan graphs on n ~ 10^ 
vertices: the 6-regular LPS expanders on PSL(2, ¥g) for q = 29 
(front; every nontrivial eigenvalue has multiplicity at least and 
a 1000-lift of the 3-regular Peterson graph (back). 


Proposition 6. Fix d > 3 and let p = 2^Jd — Ijd. Then for all connected 
d-regular graphs G on n vertices and every fixed e > 0, the SRW satisfies 

X . / Cd,plogd_in-0(loglogn) ifp£ (1,2] 

MIX 1 I ifpe[2,oo] ’ 

where Cd,p = [2fi-l + ^Hd-i{fi || ^)]~^ for fi = [l-F(d-l)(P“ 2 )/pj-i 

Hh{fi II a) = /31ogb(^) -|- (1 — /3)logft(]^) is the relative entropy function. 
Furthermore, if G is non-hipartite Ramanujan then, with the same notation. 


ALn(\^j Cd,plogd_in +0{loglogn) if p£ {1,2] 
mix[£) I z/pe[2,oo] 


(1.4) 


1.4. Method of proof. The natural route to exploit spectral details on 
the transition kernel P for an upper bound on the L^-distance from the 
stationary distribution vr is via the L^-distance (see, e.g., [19, Theorem 3.2]). 
However, this fails to give the sought bound logrf-i ^ for the SRW, as 

we see from Proposition 6 that the SRW on Ramanujan graphs exhibits an 
L^-cutoff at jlogi/pri > {1 + rj)-^^logd-in for some r]{d) > 0 (see (2.7)). 

To remedy this, we turn to the nonbacktracking random walk (NBRW), 
which moves from the directed edge (x, y) to a uniformly chosen edge {y, z) 
such that z X. In recent years, delicate spectral information on random 
graphs has been extracted by counting nonbacktracking paths; notably, this 
was essential in the proofs that random d-regular graphs and random lifts 
are weakly Ramanujan [9, 17,18]. Here we follow the reverse route, and use 
spectral information on the graph to control the nonbacktracking paths. 























































































CUTOFF ON ALL RAMANUJAN GRAPHS 


7 



Figure 5. Eigenvalues of the nonbacktracking operator B of the 
graphs from Fig. 4 (with the LPS expander on the left). Colors of 
chords between pairs of eigenvalues 6, 6 depict the inner product of 
their corresponding eigenvectors 'w,'w' (blue near 1). 

The known relation between the spectrum of G and the spectrum of 
the nonbacktracking operator B implies that if G is Ramanujan, each of 
its nontrivial eigenvalues A is mapped to eigenvalues 0,0' G C of R with 
modulus — 1 (see Fig. 4-5 showing this effect for two Ramanujan graphs 
with drastically different spectral features). For intuition, note that, had 
the operator B been self-adjoint and transitive (it is neither), we would 
have gotten that the L^-distance at time t is 0{^y/n{d — 1)“*/^), implying 
the correct upper bound of (1 + o(l)) log^_i n for the NBRW. 

Fortunately, it turns out that, while not a normal operator, B is unitarily 
similar to a matrix A that is block-diagonal with n — 1 non-singleton blocks 
(n — 2 if G is bipartite), each of which has size 2x2 (despite potential high 
multiplicities in the eigenvalues of G) and corresponds to an eigenvector 
pair w, w' with matching eigenvalues 6, 0. This description of B appears in 
Proposition 3.1 and may be of independent interest. 

1.5. Organization. The rest of this paper is organized as follows. Section 2 
describes the reduction of L^-mixing for the SRW to that of the NBRW and 
establishes the optimality of the L^-cutoff of SRW on Ramanujan graphs for 
all p > 1 (Proposition 6). Section 3 studies the NBRW, beginning in §3.1 
with the aforementioned spectral decomposition and its properties (an exact 
computation of the off-diagonal entries is deferred to Proposition 4.1 in §4). 
In §3.2 we give the proof of the non-bipartite case, which implies Theorem 1 
and Corollary 2; and §3.3 includes the proofs of the extensions to bipartite 
and weakly Ramanujan graphs, which imply Theorem 4 and Corollary 5. 
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2. Simple random walk 


2.1. Reduction to NBRW. As described in [22] (see §2.3 and §5.2 there), 
cutoff for SRW can be reduced to cutoff for the NBRW as follows. Let 
G = (y,E) be a d-regular graph and let be the infinite regular tree 
rooted at the universal cover of G. For a given vertex x € V, consider a 
cover map (/> : — )• y with = x, and observe that if (A)) is SRW on 

started at then Xt = 4>{Xt) is SRW on G started at x. (This was also used 
in the proof of the Alon-Boppana Theorem given in [25, Proposition 4.2].) 
Similarly, if Tt is NBRW on started at (^,(t) G E{Td), and we write 
yt = (yt,yt') to denote its endpoint vertices, then Y) = (T/,!^/') given 
by y/ = (/>(y() and T/' = 4>{yt) is NBRW on G started at {x,4>{cr)). By 
symmetry, if 

St/ := {dist(^, Ai) = £} , 

then the conditional distribution of Xt given St/ is uniform over the vertices 
at distance i from ^ in T^. Therefore, 

IPx (Xt £ • I St/) = - ^{x,4>{a)) iXt-1 £ ■) • 

CT/a&E(Td) 

As a projection can only decrease total-variation distance, letting I = 
for the NBRW on G and vr be the uniform distribution over V{G), we get 


IIP, {Xt G •) - vrll^^ < e + P (dist(e, Xt) < £) , 


and in particular, taking a maximum over x shows that the SRW on G has 


^Tv(t) < £ + P(dist(.e, Xt) < i). (2.1) 

Finally, since SRW on (d > 3) is transient, Xt returns to ^ only a finite 
number of times almost surely. If Xt ^ ^ then dist(Af+i,^) — dist(Ai,.^) is 
equal to —1 with probability 1/d and -|-1 otherwise. Therefore, by the CLT, 

dist(At,0 - {{d- 2)/d)t 


M{0,1). 


( 2 . 2 ) 


Thus, if i 


{2y/d — l/d)y/t 

oo then by (2.1), for every fixed s > 0 the SRW on G satisfies 


lim sup Htv (+ s\/t\ <£ + ¥ {Z > Cds) 

n—^no V / 


(2.3) 


where Z is a standard normal random variable and cw = ^ 

2y/d{d-l) 

Conversely, the number of vertices at distance £ from a given vertex x € V 
is at most d{d — 1)^. So, on the event dist(At,^) < log^_i(en/(i), the SRW 
Xt is confined to a set of at most en vertices of G, thus its total-variation 
distance from tt is at least 1 — e. Altogether, (2.2) implies the following. 
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Figure 6. The L^-distance (p > 1) from equilibrium of SRW on the 
LPS graph on PSL(2,F29) shown in Fig. 1, highlighting p = 1, 2, oo. 

Fact 2.1. For every d-regular graph on n vertices with d > 3 fixed, and 
every fixed s,s > 0, the SRW on G satisfies 

liminf Dxv (t — s-o^logji n) > 1 — e — F (Z > Cds) (2-4) 

^ logrf_i(en/d), where Cd = and Z ~ AA(0,1). 

Comparing the two bounds (2.3)-(2.4) with the desired estimate (1.1) for 
the SRW in Theorem 1, we see that the latter will follow if we show that the 
NBRW has cutoff at time log^_i n + o{^/logn) with window o{^/log n). This 
will be achieved in §3 via a spectral analysis of the nonbacktracking walk. 

2.2. Optimal LP-mixing on Ramanujan graphs. We begin with the 
special case p = 2 of Proposition 6. 

Lemma 2.2. Fix d > 3 and let p = 2\/d — 1/d. For every fixed e > 0 and 
every connected d-regular graph G on n vertices, the SRW satisfies 

tmm (e) > I logi/p n - 0(log log n). 

Moreover, if G is non-bipartite Ramanujan then this is tight: SRW has an 
-cutoff at \ \ogiipn > {l-\-r])-^^\ogd-in for some constant p = p{d) > 0. 

Proof of Lemma 2.2. Let be the t-step transition kernel of SRW, and 
let TT be the uniform distribution on V{G). For any x G V{G), 

^ P\x, yf = P‘^\x, x) > 0 , 

y 
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where is the t-step transition kernel of SRW on T^, the infinite d-regular 
tree rooted at indeed, as argned above, if Xt is SRW on the cover tree 
then Xt = (p{Xt) is SRW on G, where cf) is the cover map, and in particular 
a return to the root in the former implies a return to the origin in the latter. 

The probability is nothing bnt the probability of a Id biased 

walk, reflected at 0, to be 0 at time 2t, well-known (cf. [35, §5, pi28]) to be 




2p2 


p 


2t 


2t-2 


2p2 p 


2t 


for p = 


2^/d^ 


In particular, using the standard expansion of the L^-distance, 


d 


E 


p{x) 


1 1 7 r(x) = ^ 




- 1 


J ^ vr(x) 

which holds for every probability distribution p, thns we have 

P\x,-) 


(2.5) 

( 2 . 6 ) 


- 1 


vr 




> Cd np'^H - 1 


for Cd = 2/9^ [(1 — p^)^/Tr] Consequently, from any initial x G F we have 


4ib^.^)> )g)"((W o(iogiog 


n) 


/ T 2 \ , 

where tlii^{x,e) is the first t where \\P^{x, ■)/Tr— becomes at most e. 

We next argue that 

2 logi/p n > 232 ioSd-i ^ every real d G (2, oo) and n > 2 . (2.7) 

Indeed, (2.7) is eqnivalent to having log(d — 1) > 2 log (^^ 7 =^) for all 
real d G (2, 00 ), which, in turn, immediately follows from the fact that 


/(rf) :=l^l„g(rf_l)_21og(^^) 

has f'{d) = 2d~^ log{d — 1 ), so /( 2 ) = 0 whereas f{d) > 0 for all d > 1 . 

Finally, when G is Ramanujan, the sought upper bound on the L^-distance 
follows from considering the spectral representation (see, e.g., [3]) 


d 


\P\x, O/tt - l||i 2 (^) = n'^\fi{x)\‘^{\i/df^ 


( 2 . 8 ) 


i=2 


for {fi}2^i an orthonormal basis of eigenfunctions with eigenvalues 

of the adjacency matrix and Ai = d, and plugging in |Ai| < 2 \/d — 1 . ■ 


Remark 2.3. A different perspective on Lemma 2.2 is given by the next 
proof of a slightly weaker statement. By the generalization by Serre [33] 
(see [13, Theorem 1.4.9]) of the Alon-Boppana Theorem [29], for every e > 0 
there exists c^^d > 0 such that G has at least eigenvalues A with 
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|A| > 2\/d — 1 — e. Applying this fact for some e{d) > 0 to be specified 
later, since ^ 11 ^ = Y 17 = 2 i^i /where A 2 ,..., An are the 

nontrivial eigenvalues of G (this follows from (2.8) since an average over x 
allows one to replace Ylx l/*(^)P by 1 for each i), we deduce that 


max 

X 




vr 


1 


Consequently, 


> - V 

L2(7r) n ^ 




\PHx, 


-1 


TT 


L2(7r) 




/2y/d-l-. 


2t 


log{n/6) 


- 0 ( 1 ). 


(2.9) 


2 log (^ 7 ^) 

The proof now follows from (2.7) as we may choose e{d),r](d) > 0 so that the 
right-hand of (2.9) would be at least {1 + r] — o{l))-g^ log^-i as needed. 

For the general case of p G [1, 00 ], we need the following simple claims. 

Claim 2.4. Let G be a d-regular graph on n vertices, and let he the 
infinite d-regular tree rooted at For every 1 < p < 00 , SRW on G satisfies 

\\P\x, O/tt - Ollp - 1 

for all x,t, where P and Q are the transition kernels of SRW on G andT^- 
Proof. By the triangle inequality w.r.t. || • 

\\P\x, O/tt - > n^P-^^/P\\P\x, Ollp - 1. 

Since P\x,y) = '3*(^:f?) for every cover map : 'F(Trf) V{G) 

with (j){f) = X, using the fact (X)^=i a*)^ ^ X]^=i fo'^ every ai,..., > 0 

and p > 1 gives 

{P\x,y)Y> {Q\i^d)Y ■ 

Summing over all y gives \\P^{x, •)||p > \\Q^{(,, Ollp; as required. ■ 

Claim 2.5. Fix d > 3 and let be the infinite d-regular tree rooted at 
There exist constants ci{d),C 2 {d) > 0 such that, for all k and t, 

^ F^{\Xt\ = k) ^ 

ci( ) _ = ^) ~ 

where \Xt\ is the distance of Xt from its origin f,, and Zt ~ Bin(t, YY)- 

Proof. The case k = 0 follows from (2.5), since ^{Z 2 t = t) = 
which is (p/ 2)2*(2*^. This extends to all k using the decomposition 
t-i 

¥^{\Xt\ =k) = = 0)Pg {{\Xfi >0 : l<j<t-^],\Xt-i\ = k) 

1=0 

and the Ballot Theorem (see, e.g., [16, §III.l]). ■ 
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Figure 7. L^-cutoff location (normalized by log^_;^ n) as a function 
of p > 1 for Ramanujan graphs with degree d = 3,..., 12. The 
functions are C^, but not at p = 2. 


For more general local limit theorems on trees, see, e.g., [21]. 


Proof of Proposition 6. With Claims 2.4 and 2.5 in mind, and using their 
notation, for every t and p G [1, oo] we have 

iio'K. 111?=((■' - = k)y 

k>0 

> = (yt + t)/2))^ . (2.10) 

k>0 


Writing jdt = {k +1)/2 (so that k = (2/3 — l)t), the large deviation estimate 
¥{Zt = fdt) X t-i/2 exp[-77e(/? || ^)t] (2.11) 

for the binomial variable Zt thus leads to the following optimization problem: 

min |^(2/3 - 1) + (/3 || ^) : i < ^ < l| . (2.12) 


(Observe that in fact /3 < since for (3 > both terms are increasing.) 
Let f{(3) denote the objective in (2.12). Then 


/'(/?) 


2{p - 1) 


P 


+ logrf_i ( 


/3 


(l-/3)(d-l) 


and solving f'{f3) = 0 we get = {d — 1)^^ Since f"{/3) is positive, 

it follows that the minimizer of (2.12) is at 




1 1 

{d- l)(p-2)/p + 1 2 ■ 


(2.13) 


(Observe that /3* = 1/2 iff p > 2, hence the two regimes for the L^-cutoff 
location as a function of p.) By (2.10)-(2.11), for some c = > 0, 


IIQ*(e,-)llp>Qt-3/2(d-i)-/(/5*)*, 
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and therefore, by Claim 2.4, for every starting vertex x, 


P\x,-) 


LPW 


1 . 


(2.14) 


This implies (1.3) (and is furthermore valid for every starting vertex x). 

For matching upper bounds in case G is a Ramanujan graph, first take 
p>2. The lower bound established above is Dp{t) > — 1. 

Recalling Lemma 2.2, for Ramanujan graphs. 


D 2 {t)<y/np\ Doo{t) < D 2 {[t/ 2 \)‘^ < np\ 


using the well-known fact (a routine application of Cauchy-Schwarz) that 


Doo{s+ t) < D2{t)D2{s), (2.15) 


where £> 2 ( 5 ) corresponds to the reversed chain (here D 2 {s) = £> 2 ( 5 ) as SRW 
is reversible). So, by the Riesz-Thorin Interpolation Theorem (see, e.g., [34, 
Theorem 1.3, p. 179]), for 2 < p < 00 , we deduce that Dp{t) < rSP~^'>/'Pp^. 
Having established (1.4) for p > 2, now take 1 < p < 2. Let 

•) = Efc = k)flk{x, •) (2.16) 


where pk is the law of the projection of NBRW on the endpoint of its directed 
edge, started at a uniform edge originating from x. By Jensen’s inequality, 

{d{d-l)'^-^)~^^^\\ilkix,-) - k\\p < {d{d - ■) - 

Notice that 


n 




Pk{x, ■ 


-1 


TT 


< max 
L2(7r) y:xy£E{G) 


Pk-i{{x,y), •) 


-1 




2 


where pk is the fe-step transition kernel of the NBRW and is its stationary 
distribution. In our analysis of the NBRW in §3, we will show (see (3.13)) 
that the right-hand side of the last display is 0{nk‘^{d — 1)~^), whence 

\\Pk{x,-)-k\\p<^dk{d-i)’^<^^-p/r 

Recalling (2.16), it now follows that 

\\P\x, •) - k\\p<{t + 1) miax (P^dT)] = k)) \\pkix, ■ 


0<k<t 


ill 

n lip 


< cip maj M - (P4(|A;,| = k)) , 

0<k<t 


which, in view of (2.10), gives rise to the same optimization problem (2.12). 
Therefore, the right-hand side of the last display is at most t^{d— l)“i(^*)* 
for G > 0 fixed. Taking t as in (1.4) with a suitable additive O(loglogn) 
term gives {d — l)i(^*)* > . Thus, 

||P*(x, O/tt - IIUpw = •) - i||p < t-^ , 

establishing (1.4) for all 1 < p < 2. ■ 
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3. Nonbacktracking walks 

3.1. Spectral decomposition. The spectrum of the nonbacktracking walk 
has been thoroughly studied, in part due to the fact that its eigenvalues are 
precisely the inverse of the poles of the so-called Ihara Zeta function of 
the graph (cf. [8,20]). Our analysis here, on the other hand, hinges on 
the structure of the eigenfunctions, starting with a spectral decomposition 
of the nonbacktracking operator; this builds on properties of this operator 
that appear implicitly in [20] (see also [5,7,8] as well as [26, Exercise 6.59]). 
Proposition 3.1 below gives a more complete picture. 

Throughout this section, for a graph G = {V, E), we denote its adjacency 
matrix hy A = A{G) and let Ai = d > A 2 > ... > A„ be its eigenvalues. 
Denote by E the set of iV = 2|i7| directed edges of G; we refer to undirected 
edges as xy G E and to directed ones as (x, y) G E for the sake of clarity. 
The nonbacktracking walk matrix B is the {E x E)-matrix given by 

B(u,v),{x,y) = '^{v=x,u^y} for {u,v) and {x,y) in E. (3.1) 

Though B may not be a normal operator, it can be decomposed as follows. 


Proposition 3.1. Let G = {V,E) be a connected d-regular graph (d > 3) on 
n vertices. Let N = dn and let he the eigenvalues of the adjacency 

matrix, with Xi = d. Then the operator B from (3.1) is unitarily similar to 


N/2—n N/2—n-\-l 


A = diag d — 1, 


02 0:2 


0 d''’---’ 


I (3.2) 


where \ai\ < 2(d — 1) for all i and 6i, 9[ gC are defined as the solutions to 

(3.3) 


92 - Aid + d - 1 = 0. 


Remark 3.2. The exact value of |ai| is shown in Proposition 4.1 to be d — 2 
for every |Ai| < 2\/d — 1 and d? — \^ for every 2\/d — 1 < |Ai| < d. 

Remark 3.3. We see that every eigenvalue d ±1 of R is of the form 
A/2±y^(A/2)2 — (d — 1) for some eigenvalue A of A (with d = d — 1 matching 
the principal eigenvalue A = d). Indeed, this well-known fact follows from 
Bass’s Formula [8], which in the d-regular case is equivalent to the statement 
that fsid) = (1 “ d^)^^^ /^(d-|-(d—l)/d) for and the characteristic 

polynomials of A and B, respectively. 

(i) A = d corresponds to d = d — 1, the principal eigenvalue of B matching 
the eigenvector wi = the second solution, d' = 1, was already 

accounted for in (3.2). An eigenvalue of A = —d (when G is bipartite) 
yields d = — (d — 1) and an extra —1 eigenvalue {N — n-|- 1 altogether). 
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(ii) d — 1 < |A| < d yields two eigenvalues 9 ^ 6' gR of B. 

(iii) A < \2\/d — 1| yields 9 = 9' G C\M. with |0| = \/d — 1 (for instance, 
A = 0 corresponds to 9 = iy/d — 1 and 9' = —iy/d — 1). 

(iv) A = ±2\/d — 1 gives a single solution 9 = ±y/d — 1 with multiplicity 2. 

Remark 3.4. For each 9 G C, define Tg : —>■ by 

iTgf){x,y):=9f{y)-f{x). (3.4) 

Each solution 0 / ±1 of equation (3.3), for some A such that Af = A/, is 
an eigenvalue of B corresponding to the eigenvector Tgf] indeed, 

{BTgf)ix, y)= (^/(^) - /(y)) = ^[(^/)(2/) - /(^)] -id- l)fiy) 

z:yz£E 

z^x 

= [0A -(d- 1)] f{y) - 9 fix) = 9{Tef)ix, y ); 

where the last equality used (3.3) to replace 9X hy 9“^ + d — 1; thus, Tgf is 
an eigenfunction of B corresponding to 9 as long as Tgf 0, and clearly 
TXdf = 0 only \f 9 = ±1 (which, in turn, occurs iff A = ±4). 

Proof of Proposition 3.1. Observe that f^(E) = i\iE) © if{E) where 

^+(E) = {w : w{x,y) = w{y,x)} , f_iE) = {w : w{x,y) = -w{y,x)} , 

as the term for (x,y) in {w+,w-) cancels with that of iy,x) if w± G i'^iE). 

With this in mind, the eigenspaces of 1 and — 1 in R are straightforward: 
the star spaces S- C ifiE) and 5+ C ■f+(E) are defined by 

{ 1 u = X , 

±1 V = X, 

0 otherwise. 

For every w G if{E) and s“ as above {w,sf) = 2 2^)’ 

iBw){x,y) = —w{y,x) = w{x,y) when in addition w E s~. Thus, 

Bw = w for every w G £‘^{E) n 5:*:, (3.5) 

and similarly, 

Bw = —w for every w G ^+(E) n . (3.6) 

As for the dimension of these spaces, note that if {ax}x&v is such that 
axSf = 0 then ax = ay for every xy G E] since G is connected, this implies 
that dim(5_) = n — 1, thus B has an orthonormal system of iV/2 — (n — 1) 
eigenvectors with eigenvalue 1. Similarly, if = 0 then ax = —ay for 

every xy G E, so the eigenspace of —1 has dimension N/2 — (n — 1) if G is 
bipartite and dimension N/2 — n otherwise. 
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Having specified these eigenvectors of B as well as those corresponding to 
6i, in Remark 3.4, we proceed to analyzing their inner products. Observe 
that after appropriate permutations of its rows and columns, B becomes 
block diagonal with blocks Jd—Id, where Jd and Id are the all-one matrix and 
identity matrix of order d, respectively; thus, B has an inverse, which under 
the same permutations is block diagonal with blocks {d— 1)~^ Jd — Id, so the 
matrix C := {d—l)B~^+B (which, of course, satisfies Cw = {{d—l)/6+9)w 
for every eigenfunction w oi B with eigenvalue 9) is given by 

{ 1 {v = X, u ^ y} or {y = u, V x} , 

(d 2 ) (u, v) (y, x), 

0 otherwise. 

Thus, C is real symmetric, and I^(B) and £^(B) are invariant under it. 
Furthermore, if / G £^(V) and Wf G £‘^{E) is given by Wf{x, y) := f{y) then 

{Cwf){x,y)= f{z)+ Y f{x) - {d-2)f{x) 

z:yzGE v:vx£E 

z^x V^y 

= Y /(^)+ /(*) = (^/)(y)> 

ziyzGE 

z^x 

and similarly, if w'j,{x,y) := f{x) then {Cwj){x,y) = {Af){x). Moreover, 
{wf,Wg) = {w'f,w'g) = d{f,g) and {wf,w'g) = {f,Ag) for f,g e £‘^{V). 

In particular, the eigenfunctions (/i)f=i correspond in this way to pairwise 
orthogonal eigenspaces of C with eigenvalues the dimension of each 

eigenspace is 1 if A* = ±d and 2 otherwise (as before, Wf can be a multiple of 
w'j, only if in = c or when G is bipartite and w = c on one part and w = —c 
on the other), and they notably include the eigenfunctions To-fi of B. 

Of course, every such 2-dimensional eigenspace corresponding to A* 7 ^ ±d 
is orthogonal to the eigenvectors of B from (3.5)-(3.6) (corresponding to the 
eigenvalues ±1), as those are also eigenvectors of id for the self-adjoint C. 
Finally, the eigenvector w = 1 with the eigenvalue d—1 of R (and eigenvalue 
d of C) is orthogonal to £‘^{E) (thus to all eigenvectors from (3.5)), whereas 
if G is bipartite and we take re = 1 on outgoing edges from a prescribed 
part of G and w = —1 on the incoming ones (with eigenvalue — (d — 1) of 
B) then w T £‘^{E), thus it is orthogonal to all eigenvectors from (3.6). 

Suppose for now that A has no eigenvalue A* such that |Aj| = 2 -v/d^M^ 
Then there are two distinct solutions to (3.3) for each of the Aj’s, and so, in 
particular, the eigenspace of G corresponding to A* 7 ^ ±d has two linearly 
independent eigenvectors of B —corresponding to eigenvalues 9i and 9[. The 
orthogonality of the eigenspaces from the discussion above now establishes 
the form of A from (3.2). 
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When there exist eigenvalues of A such that |Ai| = 2\/d — 1, we have the 
nniqne solution = Ai/2 for (3.3), and claim that this gives rise to a Jordan 
block a 72 )' Indeed, recalling that BTgfi = Oifi, observe that 

{BTi+gJi){x, y) = [(1 + 6 »i)Ai - (d - l)]fi{y) - (1 + 0i)fi{x) 

= e, [(1 + ei)My) - fi{x)] + [ef + 9,-{d- 1 )] f{y) - fix) 

= OiiTi+ 0 ji){x,y) + {T 0 ji){x,y ), (3.7) 

where the second equality used 9i = Aj /2 and the last one used 9^ = d— 1 . As 
these both belong to the corresponding eigenspace of C, we arrive at (3.2). 

To conclude the proof, it remains to show that \ai\ < 2(d— 1) if Aj / ±d. 
Recall that there exist unit vectors Wi, w[ such that Bw[ = aiWi+9iw[ (these 
can be taken as columns 2i and 2i -\-l olU as above). Hence, 

[B — 9[l)wi = awi. (3.8) 

Let II • 112 - 5-2 be the £‘^{E) —)■ I'^iE) operator norm; we claim ||H|| 2 - 5.2 = d— 1. 
Indeed, it is easy to verify that 

{ d — 1 X = u and y = v, 
d -2 x^u andy = v, 

0 otherwise. 

We see that BB* has ||HH*||oo^oo = [d- — 1)^ and an eigenvalue (d — 1)^ 
corresponding to the eigenvector tc = 1; thus, ||H|| 2 - 5.2 = d — 1. By (3.8), 
using |d'| < d — 1 and ||t(;i|| = ||r(;(|| = 1 , and we infer that |q!| < 2 (d — 1 ), 
concluding the proof of the proposition. ■ 

3.2. Cutoff on non-bipartite Ramanujan graphs. On every d-regular 
graph on n vertices, the number of directed edges at distance £ from a given 
(x, y) £ E is at most (d—1)^; this readily implies (as stated in [22, Claim 4.8]) 
that the nonbacktracking random walk satisfies 

W(1 - e) > riogrf-i(f^?^)l - riogd-i(l/e)l for any 0 < e < 1. (3.9) 

Our goal in this section is to show an asymptotically tight upper bound on 
^Mix using the spectral decomposition of the nonbacktracking operator B. 


Theorem 3.5. Let G be a non-bipartite Ramanujan graph on n vertices 
with degree d > 3. Let p-t be the t-step transition kernel of the NBRW, and 
let TT be the uniform distribution on E. Then for some fixed c(d) > 0, 


max__ 

{x,y)£E 


ht{ix,y),-) _ ^ 

TT 




< 


cjd) 

logn 


at t=\ logrf_i n + 3 logrf_i log n] . 


Consequently, on any sequence of such graphs, the NBRW exhibits L^-cutoff 
and Lf-cutoff both at time log^_^ n. 
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Remark 3.6. The constant c{d) in the above theorem can be taken to be 
8 ((i — 1) log“^((i — 1) +1 for any sufficiently large enough n (cf. (3.13) below). 


Proof of Theorem 3.5. Appealing to Proposition 3.1, let U be the unitary 
matrix such that B = UAU* with A from (3.2), and write 


U = {wi\w2\w2\w3\w 3\ ...\Wn\w'j^\ui\ ... tt7V-(2n-l)) , 

in which wi = Recalling Remark 3.3, observe that the assumption 

that G is non-bipartite Ramanujan implies that for all z = 2,...,n, the 
solutions 0i,6l to (3.3) satisfy 0' = 9i and \6i\ = \/d — 1. 

Let (xq, yo) & Ehe some initial edge for the NBRW; by the expansion (2.6) 
of the L^-distance, the t-step transition kernel nt = {d — satisfies 


yf((^o,yo), •) _ ^ 

TT 


2 

L2(7r) 


^ X] \d't{ixo,yo), {x,y))\‘^ - 1 
^||yz((a;o,yo),-)|f - 1- (3.10) 


Using B = UAU* with A from (3.2) and U as specified above we find that 


B\{xo,yQ),-) ={d- lYwi{xo,yo)wi + '^{±lYui{xo,yo)ui 

i 


+ ^ 9lwi{xo,yo)wi + {9iwi{xo,yo) + 'yi{t)wi{xo,yo)) w [, 


i=2 


t-l 


where 

7 i(f) -.= ai^9l9^-^-^ 

1=0 

with Oj from Proposition 3.1. Note that in particular, as Oj < 2{d — 1), 

Mt)\<2{d-l)t\9i\^-\ (3.11) 

From the above expansion of B^, since U is unitary and wi = 

\\^^t{{xo,yo),■)\f = ^ + ^{d - l)~‘^^\ui{xo,yo)Y 


n 

+ {d- 1)“^*^ {\9iY^\wi{xQ,yo)Y + \9\w'Yxo,yQ) + -ii{t)wi{xo,yo)\ 


i=2 


(3.12) 


Now we exploit the fact that G is Ramanujan: since |0i| = \/d — 1 for every 
2 < i < n, the expression in the second line of (3.12) is at most 

(d- l)~^'^\wi{xo,yo)Y + ‘^\wiixo,yo)Y + ^ l^»(^o,yo)|^ , 

^ {d-iy 
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using the parallelogram law. Since by Parseval’s identity, 

i i 


and with (3.10) in mind, we infer that 

< 2N{d-l)-^ 


l^t{{xo,yo),-) _ ^ 2 


TT 


L‘^(n) 


1 + max 


Mtr 

id-iy 


Substituting the bound (3.11) on 7 i(t), again using that G is Ramanujan, 


^^t{{xo,yo), •) 


- 1 


TT 


L2(7r) 


< 2N{d-l)-^ (4(d- l)t2 + 1) 


(3.13) 


In particular, for t = [log^_;^ n + 3 log^_]^ log n], 

< 0(1/log n) = o(l), 


l^t{{xo,yo),-) _ ^ 2 


TT 


L2(7r) 


thus concluding the proof of Theorem 3.5. 


Using the reduction in §2.1 from SRW to NBRW (see (2.3)-(2.4)), one 
can deduce Theorem 1 from Theorem 3.5, as the O(log log n) window for 
the NBRW is negligible compared with the term sy^log^_;^ n in (1.1). 

Note that for every integer i > (2 + o(l)) log^_;^ n there is a path of length 
exactly £ between every pair of vertices x,y using Doo{2t) < D 2 {t)D 2 {t) for 
the NBRW (recall (2.15), and that the chain and its reversal are isomorphic). 

Proof of Corollary 2. Since max(j, y) ||//t((x, y), •) — ttHtv = o(l) at time t 
as per Theorem 3.5, for every x, all but o(n) directed edges can be reached 
by a nonbacktracking path of length t from x. The remark above (3.9) 
on the growth of balls in a d-regular graph thus implies the corollary: the 
statement on a nonbacktracking cycle follows from applying this argument 
once on a directed edge originating from x (and reaching almost every y 
within the proper length bound) and once on an arbitrarily chosen other 
directed edge ending at x, in the reversed NBRW. ■ 

Proof of Corollary 3. Note that at time R, the L^-distance of the NBRW 
from equilibrium is 0(l/log^/^n) by (3.13), and that k = O(logn) since 
k < g. For a uniformly chosen path {yi)^^i in G, each yi is uniform by 
the stationarity of the NBRW. Thus, by a union bound over the vertices yi, 
for each i there exists a path of length R from the edge {xi,Zi) to {yi,z'j), 
except with probability 0(A:/log^^^ n) = o(l), where Zi and z' are not on 
the paths (xi) and (yi), respectively. The conclusion now follows since, if 
vertex i of the path from Xi coincides with vertex i' of the path from Xj , then 
i+i' + k > g and {R — £) + {R — t') + k > g, so k > g — R, a contradiction. ■ 
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Remark 3.7. In the setting of Theorem 3.5, if G is in addition transitive 
then, by using the exact value \ai\ = d — 2 from Proposition 4.1 below, the 
L2 -mixing time of the NBRW can be pinpointed precisely: let 


Tcik) := {d - 2f{d - l)-i / Uk-l{xfd^lG 

for empirical spectral distribution (ESD) of G 

and Uk{cos{x)) = Chebyshev polynomial of the second kind. 

Then for any fixed e > 0, 

imx (e) = [logrf_i(n) -k logrf_i (TG(logrf_i n) + 2) + logrf_i(l/e)] . (3.14) 

Indeed, from (3.12) we see that for any non-bipartite Ramanujan graph G 
(not necessarily transitive), averaging over the initial state (xo,yo) gives 

|2 

"{d-iy 


1 V- II ^ mi2 1 , N-2n + l , 2n-2 , 


N 


{xo,yo) 


id-1) 


(d-l) 


t ’ 


using that Wi T wi and ||rii|| = ||t(;j|| = ||t(;'|| = 1 for all i. Thus, by (3.10), 


1 

N 


E 


^^tiixo,yo),■) _ ^ 2 


TT 


L^{n) 


(1 -k o(l)) ^ hi{t)y + 2j jjZTiy 


{xo,yo) 

provided that t —>■ oo with n. Writing (pi = Xi/{2y/d — 1) (so 9i = cospi for 
i = 2,..., n) and using Proposition 4.1, 


|7i(t)| = id-2) 


- 9i 


9i — 9i 


= {d-2){d-l)d-^'^/'^ 


sin(ty?j 


sm ipi 


which implies the analogue of (3.14) for the average of the mixing times over 
the initial states (xo,?/o)) thus establishing (3.14) for the transitive case. 

3.3. Extensions. We conclude with corollaries of the proof of Theorem 3.5. 


3.3.1. Bipartite Ramanujan graphs. Eollowing is the analog for NBRW in the 
bipartite case; its SRW counterpart follows from the cover-tree reduction. 


Corollary 3.8. Let G = iyQ,Vi,E) he a bipartite Ramanujan graph on n 
vertices with degree d > 3. Let pt be the t-step transition kernel of the 
NBRW, and let vro and vri be the uniform distribution on the N/2 directed 
edges originating from Vq and Vi, respectively. Then for some fixed c{d) > 0, 


max 

(a:o,3/o)eE 

3:0 sVb 


htiixo,yo), •) _ ^ 

mod 2) 


2 ^ c{d) 

L^G(t mod 2)) ~ logn 


at time 

t = [logrf_i n + 3 logd-i log n] . 

Consequently, on any sequence of such graphs, the NBRW that is modified 
to be lazy in its first step exhibits L^-cutoff and L?-cutoff at time log^_;^ n. 
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Proof. Following the arguments used to prove Theorem 3.5, observe that in 
computing E [1^4 ((xo,?/o), ix,y))mod 2 )-lP], the identity (3.10) becomes 
valid once we replace N by N/2. The only other modification needed is to 
treat An = —d, which produces the eigenvalue On = —{d — 1). Since all the 
coordinates of Wn are the contribution of this eigenvalue to the 

right-hand of (3.12) is 1/iV, exactly that of the eigenvalue d — 1 of B. The 
combined 2/N cancels via the modified identity (3.10), thus (3.13) becomes 


mod 2) 


- 1 


mod 2) ) 


<N{d-l)-^ {A{d - l)t‘^ + 1) 


which is 0(1/log n) at the same value of t. 


Corollary 3.9. Let G = (Vb,Vi,Fl) he a bipartite Ramanujan graph on n 
vertices with degree d> 3. Let P* he the t-step transition kernel of the SRW, 
and let ttq and tti be the uniform distribution on Vq and Vi, respectively. Let 
Then for every fixed s G M and every initial vertex x, the SRW at time 

* = ^2 logrf-i ^ n . 


satisfies 


max ||P*(xo,-) -vr(tmod2)| 
xosVb 


F {Z > Cds) as n —)• oo . 


where Z is a standard normal random variable and cs = ^ . 

2^d{d-l) 

Consequently, on any sequence of such graphs, the SRW that is modified to 
be lazy in its first step exhibits L^-cutoff and L^-cutoff at time log(i_i n. 


3.3.2. Weakly Ramanujan graphs. It suffices to establish the result for the 
NBRW (here we do not specify Otv for the SRW within the cutoff window, 
thus there is no need to control the NBRW within a window of o(\/logn)), 
and Theorem 4 and Corollary 5 will then follow using the above reduction. 


Corollary 3.10. Fix d > 3 and let G be a d-regular graph on n vertices 
whose nontrivial eigenvalues {Aj}(T 2 satisfy |Ai| < (1 + 6n)‘2\/d — 1 for 
some 5n going to 0 as n ^ oo. Let pt be the t-step transition kernel of the 
NBRW, and let vr be the uniform distribution on E. For some fixed c{d) > 0, 


max^ 

(x,y)&E 


Ft{{x,y),-) 

TT 


L‘2{w) 


< 


c{d) 

logn 


at time 

t= (l + 5 v^) logd_i n + 3 logrf_i log n . 

Consequently, on any sequence of such graphs, the NBRW exhibits L^-cutoff 
and Lf-cutoff both at time log^_^ n. 




















22 


E. LUBETZKY AND Y. PERES 


Proof. The analysis of blocks of A corresponding to eigenvalues Xi {i > 2) 
of A such that |Aj| < 2^/d — 1 remains valid unchanged, and it remains to 
consider the effect of 

I Aj| = (1 + e)2 \/d — 1 for some 0 < e < 5^ . (3.15) 


As mentioned in the proof of Theorem 3.5, the fact that G is Ramanujan 
is exploited when replacing \0i\'^t by {d — 1)* for all z > 2 in the spectral 
decomposition (3.12), and once again (just above (3.13)) in the bound (3.11) 
on 7 i(t). For Xi as in (3.15), the corresponding real eigenvalues 0i,9'i of B 
are given, as per (3.3), by 


^1 + e zb \/£'(2 + e)^ y/d — 1 ; 


in particular, denoting \9i\ > |0'|, we have \9i\ = (1 + + 0{e))\/d — 1 

(while at the same time \9[\ < ^/d — 1). We account for this modified value 
of in the spectral decomposition of ||/LZt((x, y), •) |p via the pre-factor 

, 2t 


1 + 


■\/^ + 0(e)^ 


< exp |^(2\/2(5n + 0{5n))t 
thus replacing the right-hand of (3.13) by 

2N{d - i)-ie[2V2Ti-HO(5„)]t _ ^^^^2 _ 


For the designated value of t (in which there is an extra additive term of 
n compared to t from Theorem 3.5) and using that 5n 0, we 
find that there exists some fixed c{d) > 0 such that \\nt ((x, y), •)/vr — l||| 2 (.n-) 
is at most 


c{d) + 0(1) 
logn 


exp 


( 2 V 2 - 5 log{d - 1) -b 0(1)) \/^t 


which is 0(1/logn) since 2^/2 < 51og(d — 1) for all d>3. ■ 


Remark 3.11. Suppose that, for some 6n = o(l) and fixed e' > 0, the graph 
G has |A| < 2^/d — 1 -|- 6n for all eigenvalues A except for exceptional 
ones, which instead satisfy |A| < d — e'. Each eigenvalue of the latter form 
corresponds to an additive term of 0{af^) in the right-hand of (3.13), where 
0 < a < 1 depends only on d and eX For the prescribed t from Corollary 3.10, 
this amounts to 0{n~^ ) for some fixed e" > 0, thus the overall contribution 
of these exceptional eigenvalues is negligible and the same result holds. 


4. Pinpointing the spectral decomposition 

The following proposition gives the precise moduli of the off-diagonal 
terms in A from the spectral decomposition (3.2) in Proposition 3.1. 
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Proposition 4.1. In the setting of Proposition 3.1, for all i > 2 we have 
Oil = 0 if Xi = —d (and i = n), and otherwise 


d-2 


if\Xi\ <2Vd^, 
if\Xi\ > 2Vd^. 


Proof. Let i > 2, and for simplicity, omit its indices from the corresponding 
subscripts; namely, let 9, 6' correspond to the eigenvalue A / ±4, and let / 
be so that Af = Xf and ||/|| = 1, where A is the adjacency matrix of G. 
Case (1): |A| 7 ^ 2xjd — 1: Recalling Tg from (3.4), we claim that 


a = 


m - e) 


where /3 := 


Indeed, taking 


w = 


Tef 


, Te'f 

w = 


{Te'f, Tef) 

WTe'fW \\Tef\\ ' 

w' — j3w 


w" = 


\w' — (Iw\\ 


(4.1) 


(4.2) 


mn ’ iir,,/ii ’ 

for /3 as above gives Bw = 9w, Bw' = 6'w, and 

lire' - = 1 + |/3p - fd {w,w') - f3{w',w) = 1 - |/3p , 

so w” = (1 — — fdw) satisfies 

„ e'w' -(dOw , „ (d{6'-6) , „ 

Bw =— , = 9 w H- , = 9 w + aw , 

yrvpp yiwpp 

as claimed. To estimate a, observe that for every / G 

WTefWl^g^ = d{\9\‘^ + l)\\f\\l^y) -{9 + 9) {Af, f)p^y^ . (4.3) 


Case (l.a): below the Ramanujan threshold. When |A| < 2y/d — 1 we 
have 0' = 0 G C \ M. Since 9'^ + d — 1 = X9 and |0| = xjd — 1, 

\\Teff = d^ -{9 + 9)X = d^ - 2{d - 1) - {9^ + 9^) 

= d^ - 2{d - 1) [1 + cos(2(^)] = {d- 2f + 2{d - 1)(1 - cos(2(/5)), 


where we let 9 = xjd — 1 exp(i(^). Similarly, 

_ d{P + 1) - 29X ___ {d-2){P -1) _ 

{d — 2)2 + 2{d — 1)(1 — cos(2(/7)) {d — 2)2 + 2{d — 1)(1 — cos{2(p)) ’ 


and so 


1-|/3|2 = 1- 


{d — 1 )^ + 1 — 2{d — 1 ) cos( 2 (^) 
d-2 + 2^(l-cos(2(^))j^ 


4()^)^(1 - cos( 29?))2 + 2{d - 1)(1 - cos( 2 (/?)) 
d - 2 + 2^{l - cos{2ip))\ 
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Substituting cos(2(/j) = 1 — 2sin^ Lp we see that 


1 - 


4(d - 1) (^1 + sin^ sin^ ^ 


d-2 + 4^ sin^ ip 


A{d — 1 ) sin^ p 
{d — 2)2 + 4(d — 1) sin^ p ’ 


and so 

\P? 1 _. {d-2f 

1 -|/ 3|2 1 -|/ 3|2 4(d-l)sinV’ 

Since 6 — O' = 2\/d — 1 sin(/?, we conclude from (4.1) that |a| = d — 2 . 

Case (l.b): above the Ramanujan threshold. For 2y/d — 1 < |A| < d, 
we have 9 ^ 0' € and assume w.l.o.g. that 9 > 9'. By (4.3) we get 

\\Teff = d{9^ + 1) - 29\ = d{9^ + 1) - 2{9^ + d - 1) = {d - 2){9^ - 1), 

and for the same reason, ||T 5)//||2 = {d — 2)(0'^ — 1). Similarly, 

{Tgf, Te,f) = d{99' + 1) - (0 + 9')\ = d^ - 

using that 99' = d — 1 whereas 9 + 9' = \ through their definition in (3.3). 
Since we also have + 9'^ = — 2{d — 1), we see that 

{9^ - 1 )( 0 '^ - 1 ) = (d - 1)2 _ (a 2 - 2 (d - 1 )) + 1 = d^ - A^ 


and altogether deduce that 

13 = 


d?-\^ 


[(^d-2){9^-l)Y[{d-2){9'^-l)] 


Therefore, 


- 1 = 


d^-A' 


Vd2 - A2 
d- 2 


d^-A^ 


(d-2)2-(d2-A2) A2-4(d-l)’ 


1 - ;02 l _ ^2 

Recalling the definition (4.1) of a, and using that 9 — 9' = y^A2 — 4(d — 1), 
we infer that = d^ — A^. 

Case (2): at the Ramanujan threshold: For |A| = 2\/d — 1 we claim 


a = 


_M _ ^here ^ 

m+ef\\v^^w IIWIII|r,/|| 


(4.4) 


To see this, take 


w = 


Tef 


, Ti+ef 
w = 


w" = 


w' — j3w 


\\Tef\\' m+efV \\w'-M\' 

since Bw' = 9w' + (||re/||/||ri+ 0 /||)u; by (3.7), while Urc' - /3wY = 1 - |/3p 

(by the same calculation as in (4.2)), 

„ 9w' + \\Ti+ef\\-^w - /39w „ 

Bw = - , - = 9w + aw , 
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as claimed. To compute a, we recall that 9 = A/2, and infer from (4.3) that 
\\Tgff = d{e^ + 1) - 29X = (f - 29X = {d- 2f , (4.5) 

Wn+eff = d{{l + Of + 1) - 2(1 + e)X = {d-2){d + 2e-2) + d, 

as well as that 

(Ti+e/, Tgf) = d{eil + 9) + !)- {29 + l)X = {d-2){d + 9-2). 

We therefore have 

2 1 {d -\- 9 — 2)^ {d -\- 9 — 2){—9) + 9{d — 2) + d 

~ ~ {d-2){d + 29-2) + d ~ {d - 2){d + 29 - 2) + d 
d-9^ _ 1 

~ {d-2){d + 29-2) + d~ llTi+e/P 
and so, by (4.4)-(4.5), a = \\Tgf\\ = d-2. ■ 
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